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Abstract 

Define at to be the smallest positive multiple of k such that the sum of its digits in 
base q is equal to k. The asymptotic behavior, lower and upper bound estimates of au 
are investigated. A characterization of the minimality condition is also considered. 



1 Motivation 

A positive integer n is a Niven number (or a Harshad number) if it is divisible by the sum of 
its (decimal) digits. For instance, 2007 is a Niven number since 9 divides 2007. A q-Niven 
number is an integer k which is divisible by the sum of its base q digits, call it s q {k) (if 
q = 2, we shall use s(k) for S2(k)). Niven numbers have been extensively studied by various 
authors (see Cai [3], Cooper and Kennedy [4j, De Koninck and Doyon [5], De Koninck, 
Doyon and Katai [6], Grundman [7J, Mauduit, Pomerance and Sarkozy [11], Mauduit and 
Sarkozy [12], Vardi [16] , just to cite a few of the most recent works). 

In this paper, we define a natural sequence in relation to g-Niven numbers. For a fixed 
but arbitrary k 6 N and a base q > 2, one may ask whether or not there exists a g-Niven 
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number whose sum of its digits is precisely k. We will show later that the answer to this is 
affirmative. Therefore, it makes sense to define at to be the smallest positive multiple of k 
such that s q (ak) = k. In other words, a& is the smallest Niven number whose sum of the 
digits is a given positive integer k. We denote by Ck the companion sequence Ck = a^/k, 
k G N. Obviously, o^, respectively, c^, depend on q, but we will not make this explicit to 
avoid cluttering the notation. 

In this paper we give constructive methods in Sections [31 0] and [7] by two different 
techniques for the binary and nonbinary cases, yielding sharp upper bounds for a^. We find 
elementary upper bounds true for all k, and then better nonelementary ones true for most 
odd k. 

Throughout this paper, we use the Vinogradov symbols S> and -C and the Landau 
symbols O and o with their usual meanings. The constants implied by such symbols are 
absolute. We write x for a large positive real number, and p and q for prime numbers. If 
A is a set of positive integers, we write A(x) = An [l,x\. We write In a; for the natural 
logarithm of x and log a; = maxjln x, 1}. 

2 Easy proof for the existence of 

In this section we present a simple argument that shows that the above defined sequence 
is well defined. First we assume that k satisfies gcd{k,q) = 1. By Euler's theorem, we can 
find an integer t such that q t = 1 (mod k), and then define K = 1 + q l + q 2t + • • • + q^" 1 )*. 
Obviously, K = (mod k), and also s q (K) = k. Hence, in this case, K is a Niven number 
whose digits in base q are only O's and l's and whose sum is k. 

If k is not coprime to q, we write k = ab where gcd(6, q) = 1 and a divides q n for some 
n G N. As before, we can find K = (mod b) with s q (K) = b. Let u = max{n, |~log„ K~\ }+l, 
and define K' = (q u + q 2u + • • • + q ua )K. Certainly k = ab is a divisor of K' and s q {K') = 
ab = k. Therefore, is well defined for every k £ N. 

This argument gives a large upper bound, namely of size exp(0(/c 2 )) for a^. 

We remark that if m is the minimal g-Niven number corresponding to k, then q — 1 
must divide m — s q (m) = kc^ — k = (c^ — l)k. This observation turns out to be useful in 
the calculation of ct for small values of k. For instance, in base ten, the following table of 
values of and can be established easily by using the previous simple observation. As 
an example, if k = 17 then 9 has to divide cyj — 1 and so we need only check 10, 19, 28. 
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16 


17 
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Ck 


19 


19 
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19 
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46 
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190 


209 


48 


247 


266 


195 


448 
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198 


874 


3980 


399 


2398 


1679 



3 Elementary bounds for a& in the binary case 

For each positive integer k we set n& = |~log 2 k~\ . Thus, n& is the smallest positive integer 
with k < 2™ fc . Assuming that k £ N (k > 1) is odd, we let t% be the multiplicative order of 
2 modulo k, and so, 2* fc = 1 (mod k). Obviously, i& > n k and t k \ 4>{k), where <p is Euler's 
totient function. Thus, 

nfe < tk < k- 1. (1) 

Lemma 1. For every odd integer k > 1, every integer x £ {0, 1, . . . , fe — 1} can 6e repre- 
sented as a sum modulo k of exactly distinct elements of 

D = {2 i \i = 0,... ,n k + k-2}. 

Proof. We find the required representation in a constructive way. Let us start with an 
example. If x = and A; = 2 nk — 1, then since x = k (mod £;), we notice that in this 
case we have a representation as required by writing k = 1 + 2 + • • • + 2 nfc_1 (note that 
rafc — 1 < n/% + k — 2 is equivalent to k > 1). 

Any j; £ {0, 1, . . . , k — 1} has at most bits of which at most n^ — l are ones. Next, let 
us illustrate the construction when this binary representation of x contains exactly n k — 1 
ones, say 

x = 2 n "~ 1 + 2 nk ' 2 + --- + 2 + l-2 j , for some j £ {0, 1, . . . , n k - 1}. 
First, we assume j < n k — 2. Using 2- ?+1 = 2 J + 2- J = 2 J + 2^ +tk (mod k), we write 
x = 2 j+t k + 2 n k -i + . . . + 2 i+ 2 + 2 i + 2^ + • • • + 1 (mod fc), 

where both j + t k < n^— 2 + /c — 1 = nk + k — 3 and j + ^ > — 1 are true according to ([1]) . 
Therefore all exponents are distinct and they are contained in the required range, which 
gives us a representation of x as a sum of exactly n k different elements of D modulo k. 

If j = njc — 1, then x = 2 rik ~ 1 — 1. We consider x + k instead of x. By the definition 
of rik, we must have k > 2 rik ~ 1 + 1. Hence, x + k > 2 nfe , which implies that the binary 
representation of x+k starts with 2™ fe and it has at most n k ones. Indeed, if s(x+k) > n^ + l, 
then x + k > 2 nk + 2 Tlk ~ 1 + • • • + 2 + 1 = 2 nfe+1 — 1, which in turn contradicts the inequality 



3 



x + k < k — 1 + k = 2k — 1 < 2 nfe+1 — 3 since k is odd. If s(x + k) = rife, then we are 
done (k > 3). If s(x + fc) = — 1, then we proceed as before and observe that this time 
j + tk < n fc + k — 2 for every j 6 {0, 1, 2, . . . , rife — 1} and j + tfe > if j > which is an 
assumption that we can make because in order to obtain rife — 1 ones two of the powers of 
2, out of 1,2,2 2 ,... ,2™ fe ~\ must be missing. 

If s(x + k) < rife — 1, then for every zero in the representation of x + k, which is 
preceded by a one and followed by t (£ > 0) other zeroes, we can fill out the zeros gap in 
the following way. If such a zero is given by the coefficient of 2 J , then we replace 2- 7+1 by 

2 J +2 J ~ 1 H h 2- 7 '-^ + 2-?^ +ifc . This will give ^ + 2 ones instead of a one and i + 1 zeros. We 

fill out all gaps this way with the exception of the gap corresponding to the smallest power 
of 2 and I > 1, where in order to insure the inequality j' + tfe > rife {j 1 = j — £ + 1 > 0) one 

will replace 2-? +1 by 2 j + 2^ H h 2 J '~ m + 2^ £+1+ * fc . The result will be a representation 

in which all the additional powers 2 J ' + * fc will be distinct and the total number of powers of 
two is rife. The maximum exponent of these powers is at most j' + t/~ < + k — 2. 

If the representation of x starts with 2 nfe_1 , then the technique described above can be 
applied directly to x making sure that all zero gaps are completely filled. Otherwise, we 
apply the previous technique to x + k. □ 

Example 2. Let k = 11. Then rin = 4 and t\\ = 10. Suppose that we want to represent 9 
as a sum of A distinct terms modulo 11 from the set D = {1, 2, . . . , 2 13 }. Since 9 = 2 3 + 1, 
we have 9 = 2 2 + 2 + 2 + 1, so 9 = 2 2 + 2 + 2 11 + 1 (mod 11). // we want to represent 
7 = 2 2 + 2 1 + 2° then, since this representation does not contain 2 3 , we look at 7 + 11 = 
18 = 2 4 + 2 = 2 3 + 2 3 + 2 = 2 3 + 2 2 + 2 2 + 2. Thus, 7 = 2 3 + 2 2 + 2 12 + 2 (mod 11). 

We note that the representation given by Lemma [1] is not unique. If this construction is 
applied in such a way that the zero left when appropriate is always the one corresponding 
to the largest power of 2, we will obtain the largest of such representations. In the previous 
example, we can fill out the smallest gap first and leave a zero from the gap corresponding 
to 2 3 , so 7 = 18 = 2 4 + 2 = 2 3 + 2 3 + 1 + 1 = 2 3 + 2 13 + 1 + 2 10 (mod 11). 

Recall that 2 Q ||m means that 2 a \ m but 2 a+1 \ m. We write ^(m) for the exponent a. 

Theorem 3. For all positive integers k and £, there exists a positive integer n having the 
following properties: 

(a) s(nk) = ik, 

(b) n < (2 lk+n >< - 2^( k ))/k. 
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Proof. It is clear that if A; is a power of 2, say k = 2 s , then we can take n = 2 — 1 and 
so s(kn) = s(2 s + 2 s+l + • • • + 2 s+ek ~ 1 ) = Ik. In this case, the upper bound in part (b) is 
sharp since n k = s = (J,2(k). 

Furthermore, if k is of the form k = 2 m d for some positive integers m, d with odd d > 3, 
then assuming that we can find an integer n < (2 2m t- d + n k — l)/d, where n k = \^og 2 d~\ , such 
that s(nd) = 2 m £d, then nk satisfies condition (a) since s(nk) = s(2 m nd) = s(nd) = 2 m id = 
tk. We observe that condition (6) is also satisfied in this case, because (2™+ n * - l)/d = 

Thus, without loss of generality, we may assume in what follows that k > 3 is odd. 

Consider the integer M = 2 ek+nk -1 = 1 + 2 1 H h 2 ek+rik - 1 , and so, s(M) = Ik + n k . 

By Lemma [TJ we can write 

M = 2 jl +2 h + ••• + 2 jn k (mod k), (2) 

where < j\ < ]2 < ■ ■ ■ < jn < k + — 2 < Ik + — 1. Therefore, we may take 

M - (2^ +2^ + ■■■ + 2^k ) 
n = k ' 

which is an integer by (J2J) and satisfies s(nk) = s(M - (2 jl + 2 j ' 2 H h 2 jn k )) = Ik. □ 

Corollary 4. The sequence (a,k)k>i satisfies 

2 k - 1 < a k < 2 k+nk -2^ k \ (3) 

Proof. The first inequality in ^ follows from the fact that if s(a&) = k, then > 1 + 2 + 
• • • + 2 k ~ 1 = 2 k — 1. The second inequality in ([3]) follows from Theorem [3] by taking £ = 1, 
and from the minimality condition in the definition of a&. □ 

We have computed a k and for all k = 1, . . . , 128, 

ci = 1, c 2 = 3, c 3 = 7, . . . , c 20 = 20 9715, 

and the graph of k — > ln(cfc) against the functions A; — > ln(2 fc ) and A — ► ln(2 fc — 1) — ln(A) is 
included in Figure [TJ 

The right hand side of inequality ([3]) is sharp when k = 2 s , as we have already seen. For 
k = 2 s — 1, we get values of c k very close to 2 k — 1 but, in general, numerical evidence shows 
that c k /2 k is closer to zero more often than it is to 1. In fact, we show in Section [6] that this 
is indeed the case at least for odd indices (see Corollary [TTl Corollary [12] and relation ([23]) ) . 
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Figure 1: The graphs of k —> ln(cfc) and k — > ln(2 fc ), k — > ln(2 fc — 1) — ln(k) 

4 Improving binary estimates and some closed formulae 

In order to obtain better bounds for a^, we introduce the following classes of odd integers. 
For a positive integer m we define 

m 

C m = {k=l (mod 2)| 2 k+m -1=^2 2 ji (mod k), for < j x < j 2 < ■ ■ ■ < j m <m + k- 2}. 

z=l 

Let us observe that C m C C m +i- Indeed, if k is in C m , we then have 2 k+m — l = 2 J1 + • • •+2- ?m 
for some < j% < ]2 < ' ' ' < im < m + k — 2. Multiplying by 2 the above congruence and 
adding one to both sides, we get 2 k+m+1 — 1 = 1 + 2 jl+1 + • • • + 2 Jm+1 , representation which 
implies that k belongs to C m +i. Note also that Lemma [T] shows that every odd integer k > 3 
belongs to C u , where u = [log kj log 2] . Hence, we have 2N + 1 = UmeN ^m- 

Theorem 5. For every k € C\, we have 

2 k - 1< a k < 2 k+1 - 1. 

In particular, Ck/2 k — > as k ^> oo through C\. Furthermore, = 2 k+1 — 1 — 2 J1 , where 
ji = Jo + s ifc> wi/i s = [(k — 1 — jo)/tk\ , and < jo < tk — 1 sitc/i that 2 k+l — 1 = 2 J0 
(mod k). 

Proof. We know that 2 k - 1 ^ (mod fc) (see [13 Problem 37, p. 109]). Hence, an integer 
of binary length k whose sum of digits is k is not divisible by k. Therefore, > 2 k — 1. 
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Next, we assume that ak is an integer of binary length k + 1 and sum of digits k; that is, 
ak = 2 k+1 — 1 — 2 J for some j = 0, . . . , k — 1. But 2 k+1 — 1 = x (mod /c), and by hypothesis 
there exists jo such that x = 2 J0 for some jo 6 {0, . . . , t}. — 1}. In order to obtain ak, we 
need to subtract the highest power of 2 possible because of the minimality of ak- So, we 
need to take the greatest exponent j\ = jo + stk < k — 1, leading to s = [(k — 1 — jo)/tk\ ■ 
Hence, a k = 2 k+1 -1-2^. □ 

Based on the above argument, we can compute, for instance, 05 = 55 = 2 6 — 1 — 2 3 , 
since 2 3 - 1 = 2 3 (mod 5). Similarly, a 29 = 2 30 - 1 - 2 5 = 1073741791, since 2 30 - 1 = 2 5 
(mod 29), and a 25 = 2 26 - 1 - 2 19 = 66 5 8 4 5 75, since 2 26 - 1 = 2 19 (mod 25), or perhaps the 
more interesting example 0253 = 2 254 — 1 — 2 242 . 

Theorem 6. If m £ N, and k £ C m +i \ C m , we then have 

2 k+m-l _ 1<ak< 2 k+m _ L 

Thus, Ck/2 k — > as k — > 00 in C m for any fixed m. 

Proof. Similar as the proof of Theorem [5j □ 

Theorem 7. For all integers k = 2 % — 1 > 3, we have 

a k < 2 k+k ~ + 2 k - 2 k ~ { - 1, (4) 

where k~ is the least "positive residue of —k modulo i. Furthermore, the bound is tight 
when k = 2 l — 1 is a Mersenne prime. In this case, we have Ck/2 k — » 1/2 as k — > 00 through 
Mersenne primes, assuming that this set is infinite. 

Proof. For the first claim, we show that the sum of binary digits of the bound of the upper 
bound on Q is exactly k, and also that this number is a multiple of k. From the definition 
of k~ , we find that k + k~ = ia for some positive integer a. Since 

2 k+k- + 2 k _ 2 k-i _ l = 2 k ~ i {2 i - 1) + 2 ia - 1 

= (2 i - l)(2 k ^ + 2 i{ - a ~ l) + 2^ a " 2 ) H + 1), 

we get that 2 k+k + 2 k — 2 k ~ % — 1 is divisible by k. Further, k~ > 1 since k is not divisible 
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by i (see the proof of Theorem [5]) , and 

s ^ 2 fe+fe- + 2 k _ 2 k-i _ A = / 2 fc+*--i + ... + 2 + l + 2 fc -2 k - 

= s{2 k+k '~ x ^ + 2 k + --- + 2 k ~ r U +2+l+2 fc ) 

= s (2 k+k ~ + 2 k ~ 1 + • • • + 2p l + • • • + 2 + l) 

= k, 

where t means that t is missing in that sum. The first claim is proved. 

We now consider a Mersenne prime k = 2 l — 1. First, we show that k £ Ci\ Cj_i. Since 
u = [log kj log 2] = i, by Lemma[IJ we know that k G Cj. Suppose by way of contradiction 
that k G Cj_i. Then 

2 k+i-i _ x = 2 J'i + . . . + 2^-i (mod A;) (5) 

holds with some < ji < j2 < • • • < ji-i < A; + i — 3. Since k is prime, we have that 
2 k ~ 1 = 1 (mod k), and so 2 fe+i ~ 1 - 1 = 2' - 1 = (mod k). 

Because 2 % = 1 (mod A;), we can reduce all powers 2 J of 2 modulo A; to powers with 
exponents less than or equal to i — 1. We get at most i— 1 such terms. But in this case, the 
sum of at least one and at most i — 1 distinct members of the set {1, 2, . . . , 2 4-1 } is positive 
and less than the sum of all of them, which is k. So, the equality ([5]) is impossible. 

To finish the proof, we need to choose the largest representation x = 2 n + • • • + 2 Ji , 
with < ji < j 2 < ■ ■ ■ < ji < k + i - 2, such that 2 k+i - 1 = x (mod k). But 2 k+i - 1 = 
2 l+1 — 1 = 1 (mod A;). Since the exponents j are all distinct, the way to accomplish 
this is to take ji = k + i — 2, = k + i — 3, . . . , j2 = k, and finally j\ to be the 
greatest integer with the property that the resulting x satisfies x = 1 (mod k). Since 
x = 2- 7 ' 1 + 2 k (l + 2 + • • • + 2 i ~ 2 ) = 2^ + 2 k (2 i - 1 - 1) = 2^ + 2* - 2 = 2^ - 1 (mod k), 
we need to have 2 J1 = 2 (mod A:). Since the multiplicative order of 2 modulo k is clearly 
i, we have to take the largest ji = 1 + si such that 1 + < /c. But i must be prime too 
and so 2*" 1 = 1 (mod i). This implies k = 2 l — 1 = 1 (mod i). Therefore ji = k — i. So, 
a fc = 2 fc+i - 1 - x = 2 fc+i - 1 - 2 fc_i - 2 fc+i - 1 + 2 fc = 2 fc+i ~ 1 + 2 fc - 2 fc_i - 1 and the inequality 
given in our statement becomes an equality since k~ = i — 1 in this case. 

Regarding the limit claim, we observe that 

c fc _ k + 1 1 1 1 1 

2* ~ 2A; + k~ k¥ ~ k2^ ' 2' 

as « (and as a result A;) goes to infinity. □ 
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Between the two extremes, Theorems [6] and we find out that the first situation is 
more predominant (see Corollary I12j) . Next, we give quantitative results on the sets C m . 
However we start with a result which shows that C\ is of asymptotic density zero as one 
would less expect. 

5 Ci is of density zero 

Here, we show that C\ is of asymptotic density zero. For the purpose of this section only, 
we omit the index and simply write 

C = {1 < n : 2 n+1 -l = 2 j (mod n) for some j = 1, 2, . . .}. 

It is clear that C contains only odd numbers. Recall that for a positive real number x and 
a set A we put A(x) = A n [1, x]. We prove the following estimate. 

Theorem 8. The estimate 

#C(x) « 



(log log x) 1 / 7 
holds for all x > e e . 

Proof. We let x be large, and put q for the smallest prime exceeding 



1 / log log x 



2 V log log log x 



1/2 



Clearly, for large x the prime q is odd and its size is q = (1 + o(l))y as x — > oo. For 
an odd prime p we write t p for the order of 2 modulo p first defined at the beginning of 
Section [3j Recall that this is the smallest positive integer k such that 2 k = 1 (modp). 
Clearly, t p \ p — 1. We put 

V = {p prime : p = 1 (mod q) and t p \ (p — l)/q}. (6) 

The effective version of Lagarias and Odlyzko of Chebotarev's Density Theorem (see |10j . or 
page 376 in [H]), shows that there exist absolute constants A and B such that the estimate 
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holds for all real numbers t as long as q < i?(logt) 1//8 . In particular, we see that estimate 
(J7|) holds when x > x$ is sufficiently large and uniformly in ( £ [z,x], where we take 
z = exp((loglogx) 100 ). 

We use the above estimate to compute the sum of the reciprocals of the primes p 6 'P(ii), 
where we put u = x 1//10 °. We have 

s= v - = v-+ E - = si + s 2 . 

z — ' p p z — ' p 

p£V(u) pev ^ pev ^ 

p<z z<p<u 

For Si, we only use the fact that every prime p € V is congruent to 1 modulo q. By the 
Brun-Titchmarsh inequality we have 

c ^ 1 „ loglogz logloglogx 

p=l (mod q) 

For S2, we are in the range where estimate (J7J) applies so by Abel's summation formula 



2<p<U 



1 _ /•« _ #V{t) t=u 

t=z 



P J z t t 

P ev y Jz 



+ / i / (t L +oi — . „ ii r// 



g(g-l)t 2 Vexp(Avlogi/g) 



+0\^)+0 



q( q -i)t\ogt WJ \Jz q(q - i)i(iogt) 2 



log loft 11 - log loft ^ / J_\ _ log log x + ^ ! , 



q(q-l) \q 2 J q(q - 1 

In the above estimates, we used the fact that 



logt \(logt) 2 y ' 
as well as the fact that 

+ 

O 



exp{A^ftogt/q) \q 2 (logt) 2 
uniformly for t > z. To summarize, we have that 



log log x + ^ = log log x + / log log a + A = log log x 

9(9 - 1) <? 2 V 9 3 / <? 2 
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We next eliminate a few primes from V defined in (|6|). Namely, we let 

7'i = {p:tp<p 1 /2/(iogp)io }j 

and 

V2 = {p '■ p — I has a divisor d in [p 1//2 /(logp) 10 ,p 1 ^ 2 (logp) 10 ]}. 
A well-known elementary argument (see, for example, Lemma 4 in [2]) shows that 

therefore by the Abel summation formula one gets easily that 

As for V2 , results of Indlekofer and Timofeev from [9] show that 

t log log t 



#V 2 (t) « 



(log t) 1+5 ' 



where 5 = 2 — (1 + log log 2)/ log 2 = 0.08. . ., so again by Abel's summation formula one 
gets that 



, P 



We thus arrive at the conclusion that letting Q = V\{V\ U V2), we have 

s '= £ £ ^ = ^ + 0(1). (10) 



p * — ' p flf 



Now let us go back to the numbers n £ C. Let T>\ be the subset of C(x) consisting of the 
numbers free of primes in Q(u). By the Brun sieve, 

#^ « x n (i- 1 -)=xe W l- y: l+o{ e 3 



peQ(n) v y \ peQ(«) \peQ(it) 

« zexp(-5' + 0(l))<xexp(-^^ 



(log log x) 4+ °( 1 ) (log log X 



« „ . x (11) 
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Assume from now on that n G C(x)\D\. Thus, p \ n for some prime p G Q(u)- Assume that 
p 2 | n for some p G Q{u). Denote by V 2 the subset of such n G C(x)\D\. Keeping p G Q(ii) 
fixed, the number of n < x with the property that p 2 \ n is < x/p 2 . Summing up now over 
all primes p = 1 (mod (?) not exceeding x 1 / 2 , we get that the number of such n < x is at 
most 

#^ 2 < V -«^ — « rn — • (12) 

p £ log g log log x 

p^x 1 ' 2 
p=l (mod q) 

Let T>3 = C(x)\(T>i U T> 2 ). Write n = pm, where p does not divide m. We may also assume 
that n > x/ log a; since there are only at most x/ log x positive integers failing this condition. 
Put t = t p . The definition of C implies that 

2 mp+l = 2 j + 1 ( mod p j 

for some j = 1,2, ... ,i, and since 2 P = 2 (mod p), we get that 2 mp+1 = 2 m+1 (mod p). 
We note that 2 m+1 (mod p) determines m < x/p uniquely modulo t. We estimate the 
number of values that m can take modulo t. Writing X = \2? (mod p)}, we see that #{m 
(mod p)} < I/t, where / is the number of solutions (xi,X2,X2) to the equation 

xi - x 2 - x 3 = 0, xi, x 2 , x 3 G X. (13) 

Indeed, to see that, note that if m and j are such that 2 m+1 = 1 + 2? (mod p), then 
(x x , x 2 , x 3 ) = (2 m+1+ y, 2 V , 2 j+y ) for y = 0, . . . , t - 1, is also a solution of equation {13}, and 
conversely, every solution (xi,X2,X3) = (2 yi , 2 V2 , 2 V3 ) of equation (|13p arises from 2 m+1 = 
1 + 2 J (mod p), where m + 1 = yi — y 2 and j = y% — y 2 , by multiplying it with 2 V2 . 

To estimate I, we use exponential sums. For a complex number z put e(z) = exp(27riz). 
Using the fact that for z G {0, 1, ... ,p — 1} the sum 



l p_1 



a=0 



is 1 if and only if z = and is otherwise, we get 



1 P_1 

j = - ^2 ^2 e ( a ( x ± - x 2 - x 3 )/p). 
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Separating the term for a = 0, we get 



I= { *^+ 1 -Y j £ e(a(x 1 -x 2 -X3)/p) = - + -X:^r 2 a , 
p p p p ' 

a=lii,i2,3;3£A o=l 

where we put T a = X^ex e(axi/p). A result of Heath-Brown and Konyagin [8], says that 
if a 7^ 0, then 



Thus, 



£ 3 



J=fL + 0(t 9 / 8 p 3/4 ), 
P 

leading to the fact that the number of values of m modulo t is 

#{m (mod t)}<-<- + 0(t 1/ V /4 )- 
t p 

Since also m < x/p, it follows that the number of acceptable values for m is 



X ( I/O O I A \ Xt X 



/■1 



pt \p J p 2 tf/^p 1 / 



(note that x/pi > 1 because pt < p < u < x). Hence, 

# p 3< E E ^v? = Ti+r2 - 



For the first sum T\ above, we observe that t < p/q, therefore t/p 2 < l/(pq). Thus, the 
first sum above is 

sr^ x xS' x log log x (logloglogx) 3/2 

Ti < > — < < 5 < x— — — — , (14) 

^jPQ Q <r (log log x) 1 ' 2 



where we used again estimate ()10p . Finally, for the second sum T2, we change the order of 
summation and thus get that 

r 2<^E^s E 374' ( 15 ) 

t>t p€Q(u) F 

t(p)=t 
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where to = to(q) can be taken to be any lower bound on the smallest t = t p that can show 
up. We will talk about it later. For the moment, note that for a fixed t, p is a prime factor 
of 2* — 1. Thus, there are only O(logt) such primes. Furthermore, for each such prime we 
have p > qt. Hence, 



X fogt 

t>t 



' 2 " V: qi/4 S t i),s 



Since p V\ U V2, we get that t p > p 1//2 (logp) 10 . Since p > 2q + 1, we get that t 3> 
q 1 / 2 (\og q) 10 . Thus, for large x we may take to = Q 1 ^ 2 {^ogq) 9 and get an upper bound for 
T 2 . Hence, 

X x - log t x f 00 log s 

^ oV4 ^ t9/8" < oV4 / 1/2/, ^9^978" 



log s 



V4 V S 1 



V a Qoeq)*J < ? l/4+l/16(k)g g )l/8 ^ gS/16(lo. 



x(log log log x) 1/32 
(log logx) '^ 



Combining the bounds (|14|) and (|16p . we get that 

x 

3 (log log x) 1 / 7 ' 

which together with the bounds (fTT|) and (fT2|) completes the proof of the theorem. □ 

Although the density of C\ is zero, one my try to calculate the densities of C m (m > 
1) hoping that they are positive and approach 1 as m — > 00. In the Figure [2] we have 
numerically calculated the density of C2 within the odd integers up to 63201. Nevertheless, 
we abandoned this idea having conjectured that the density of each C m is still zero. However, 
the next section gives a way out to proving that Ck/2 k goes to zero in arithmetic average 
over odd integers k. 



6 The sets C m for large m 

In this section, we prove the following result. 

Theorem 9. Put m(k) = [exp(4000(log log log A;) 3 )J . The set of odd positive integers k 
such that k E C m (/%) is of asymptotic density 1/2. 
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50 100 150 200 250 300 

X 

Figure 2: The graph of 2 * C2 } X \ 1 < x < 63201, x odd. 



In particular, most odd positive integers k belong to C m ^y 
Proof. Let x be large. We put 

y = (log log x) 3 . 

We start by discarding some of the odd positive integers k < x. We start with 

A\ = {k < x : q 2 \ k, or q(q — 1) | k, or q 2 \ cj)(k) for some prime q > y}. 

Clearly, if n E A.\, then there exists some prime q > y such that either q 2 \ n, or q(q — 1) | n, 
or q 2 | p — 1 for some prime factor p of n, or n is a multiple of two primes p\ < P2 such that 
q | Pi — 1 for both i = 1 and 2. The number of integers in the first category is 



X 

7u 



< X 



y<g<x 1 / 2 



„l/2 



dt x 



t 2 y (log log x) 



o(x) 



as x — > oo. Similarly, the number of integers in the second category is 



< 



E 

y<q<x 1 / 2 +l 



.9(9-1) 



El x 



g 2 y (log log x) 3 



o(x) 



15 



as x — > oo. The number of integers in the third category is 



£ E E 

j/<g<a: 1 / 2 P< 



j/<g<a: 1 /2 p< 



V 



E 

y<q<a; 1 / 



p=l (mod q 2 ) p=l (mod </ 2 ) 

log log X 



1/2 ^ 



<C x log log a; ^ 



y<q<x x l 2 



x log log X 



= o(x) 



y (log log x) 2 

as x — > oo, while the number of integers in the fourth and most numerous category is 



< 



E E 



y<q<x 1 / 2 Pl<P2<x 

Pi=l (mod q), i=l,2 



,PlP2 



^ E E 

? /< g <x 1 /2 pi<p 2 <z 

Pi=l (mod q), i=l,2 



\ 



1 1 
< x > - > 

yKqKx 1 / 2 I p<x . 

\p=l (mod g) / 



E 

i/<g<a; 1 / 2 



log log X 



n 1 \2 1 x(loglogx) 

<x(loglogx) 2 2^ — <C v ' 



y<q<x x / 2 



log log X 



o(x) 



as x — ► 00. We now let 

Q = {p:t p <p 1 / 3 }, 

and let A2 be the set of k < x divisible by some q 6 Q with q > y. To estimate #^2, we 
begin by estimating the counting function #Q(t) of Q for positive real numbers t. Clearly, 

2 #SW < Yl q< H (2 s - 1) < 2 E *<* 1 /3 s < 2 i2/3 , 

?€S(t) s<tV3 



so 



#Q(t) < t 2/3 . 
By Abel's summation formula, we now get that 



(17) 



#A 2 < E 



y<q<x 
qeQ 



< X 



E 

y<q<x 

qeQ 



- < X / 

9 A 



x d#Q(i) << x 



V 3 log log X 



o(x) 
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as x — > oo. 

Recall now that P(m) stands for the largest prime factor of the positive integer m. 
Known results from the theory of distribution of smooth numbers show that uniformly for 
3 < s < t, we have 

$(t, s ) = #{m < t : P(m) <s}<^t exp(-u/2), (18) 
where u = log tj logs (see [15j Section III. 4]). Thus, putting 

z = exp (32 (log log log x) 2 ) , 

we conclude that the estimate 

(log log x)° 

holds uniformly for large x once t > z, because in this case u = > ^ log log log x, 
therefore 

u 16 

- > y log log log x, 

so, in particular, u/2 > 5 log log log x holds for all large x. Furthermore, if t > Z = 
exp((loglogx) 2 ), then 

logt (log log x) 2 
log y 3 log log log x ' 

so u/2 > 2 log log x one x is sufficiently large. Thus, in this range, inequality (fT9j) can be 
improved to 

*(*,!/) « T^r-j 7 « 7T^2 • ( 20 ) 

exp(21ogloga;) (logxj^ 

Now for a positive integer m, we put d(m, y) for the largest divisor doim which is y-smooth, 
that is, P(d) < y. Let ^3 be the set of k < x having a prime factor p exceeding z 10 such 
that d(p — 1, y) > p 1 / 10 . To estimate #^3, we fix a y-smooth number <i and a prime p with 
z ±o < p < such that p = 1 (mod cZ), and observe that the number of n < x which are 
multiples of this prime p is < L x /pJ- Note also that d > p 1 / 10 > z. Summing up over all 
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the possibilities for d and p, we get that #^3 does not exceed 



E E 

P{d)<yp=l (mod <f) 



1 v - log log X 

P(d)<yp=l (mod rf) P(d)<!/ 



<C x(loglogx) 2 ^ - <C x(loglogx) 2 y 



2 f x d^(t,y) 



z<d 
P(d)<y 



t 



<c x (log log x) 



2 / *(t,y) 



(log logx) 3 



+ x(log logx)^ 



2 [ x *(t,y)dt 
t 2 ' 



In the above estimates, we used aside from the Abel summation formula and inequality (|19p . 
also the minimal order of the Euler function (j){d)/d 3> 1/ log log x valid for all d £ [l,x]. It 
remains to bound the above integral. For this, we split it at Z and use estimates f)19[) and 
(|20p . In the smaller range, we have that 



V(t,y)dt 1 



z dt log Z 1 
< t, — r- « 



i 2 ' (log log x) 5 ,/^ i (log logx) 5 (log logx) 



In the larger range, we use estimate (|20|) and get 



*(t,y)dt ^ 1 r* dt ^ 1 

z i 2 (logx) 2 7 z i logx' 



Putting these together we get that 



(log log x) 



+ x(log logx) 



+ 



(log logx) 3 logx 



o(x) 



as x — > 00. 



Now let 4 = d(/c,z 10 ). Put 



u; = exp(1920(log log logx) 3 ), 



and put ^4.4 for the set of k < x such that £ > w. Note that each such has a divisor d > w 
such that -P((i) < z 10 . Since for such d we have 

logd 



log(z 10 ) 



6 log log log x, 
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we get that in the range t > w, u/2 > 3 log log log x, for large x, so 



*(M 10 )< n * , 3 ( 21 ) 

(log log x)° 



uniformly for such t once x is large. Furthermore, if 

t > Z\ = exp(12801oglogx(logloglogx) 2 ), 
then u = fog^io > 4 log log x therefore u/2 > 2 log log x. In particular, 



*(*>* 10 ) « 7T7T32 ( 22 ) 



(log x 

in this range. By an argument already used previously, we have that j^At± is at most 



* E [iJ^ E ^«*/ u 



df(t,z 10 ) 



w<d<x w<d<x 
P(d)<z 10 P(d)<z 10 



ION 



+ 

t=w 



x y(t,z 10 )dt 

T 2 



1 + f Zl ^{t,z 10 )dt f x ^(t,z l0 )dt 



(log log x) 3 J w t 2 J Zl t 2 

<.x( 1 + lQ g z i + lQ g j A _ / -j 

\ (log log x) 3 (log log x) 3 (logx) 2 / 

as x — ► oo, where the above integral was estimated by splitting it at Z\ and using estimates 
(|2ip and (|22p for the lower and upper ranges respectively. 



Let be the set of k < x which are coprime to all primes p £ [y,z 10 ]. By the Brun 
method, 



rr / 1\ xlogy 

« x n i - - « — ^ « 

,7 7- V 9/ 1°S z 1°S 1°§ 1°§ x 



as x — > oo. 



We next let Aq be the set of k < x such that P(k) < w 100 . Clearly, 



o(x) 



#A = *(xV°°) = xexp ( -ci - .„ ) = o(x) 



(log log log x) 3 
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as x — > oo, where c\ = 1/384000. 
Finally, we let 

A7 = {k < x : dp | k for some p = 1 (mod d) and p < d 3 }. 

Assume that k £ .A7. Then there is a prime factor p of k and a divisor (i of p — 1 of size 
d > p 1 / 3 such that <ip | k. Fixing d and p, the number of such n < x is < |_x/(dp)J. Thus, 



#^7 < 



E E 

y<p<x d\p—l L " 



< X 



E 

y<P<x 



P 



E 

d>p 1 /3 



« E - 



y<P<x 



-(p - Ij 

1/3 



P 



E 

y<P<x 



Tip ~ 1) 



P 



1+1/3 



i^P 5/4 



y<P<x 



x 



(5/4 yl/4 (log log X) 3 / 4 



o(x) 



as x — > 00. Here, we used r(m) for the number of divisors of the positive integer m and the 
fact that r(m) <C £ m £ holds for all e > (with the choice of e = 1/12). 

From now on, k < x is odd and not in Ui<i<7^- From what we have seen above, most 
odd integers below x have this property. Then £ < w because k A4. Further, k/t is square- 
free because k A\. Moreover, if p \ k/£, then p > z 10 > y, therefore t p > p l l 3 because 
k A2- Since k .A3, we get that d{p — l,y) < p 1 / 10 , so t' p = t v j gcd(t p ,d(p — l,y)) > 
pi/3-1/10 > pi/5 £ or a n p_ Moreover, t' p is divisible only by primes > z > y, so if p\ and 
P2 are distinct primes dividing k/£, then if and t' p2 are coprime because k £ Ai- Finally, 
£ > y because k A5. Furthermore, for large x we have that w > y, so k > £ and in fact 
k/t is divisible by a prime > w wo because k Aq. 

We next put n = lcm[d(<p(k),y),(j)(£)]. We let no stand for the minimal positive integer 
such that no = —k + 1 (mod 4>{£)) and let m = no + £$(£). Note that 

m < 2^>(£) < 2u> 2 = 2exp(3840(logloglogx) 3 ). 

We may also assume that k > x/logx since there are only at most x/logx = o(x) positive 
integers k failing this property. Since k > x/logx, we get that 

m < 2exp(3840(logloglogx) 3 ) < Lexp(4000(logloglog/c) 3 )J = m(k) 
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holds for large x. We will now show that this value for m works. First of all m + k = 

n + £<t>(£) + k = l (mod </>(£)) so 



2 m+k _ 1 = 1 = 2^ _1 + H h 2*W-( n o- 1 ) + 2W-("°- 1 )+ 

+ 2 Xin + • • • + 2 Xtn (mod*), 

where t = £(f>(£) and a?i, . . . ,xt are any nonnegative integers. Let 

jj = 2 m+k _ 1 _ 2 <K*)-1 2 </.W-(n -i) _ 2 0W-("o-l)_ 

Then 



U = Y^^ m (mod*) 



i=l 

for any choice of the integers xi,...,xt. Let p be any prime divisor of k/£. Clearly, 
gcd(tp,n) = d(t p ,y), because t p \ 4>(k) and n A\. In particular, 

t > = b. > h. > pi/3-1/10 > i/6_ 

p gcd(t p ,n) gcd(d(0(fc),2/),2>- 1) 

Let X = \2? n (mod p)}. Certainly, the order of 2 n modulo p is precisely t' p . So, #X = t' > 
pi/5 j± recen ^ result of Bourgain, Glibichuk and Konyagin (see Theorem 5 in [1]), shows 
that there exists a constant T which is absolute such that for all integers A, the equation 

A = 2* in + • • • + 2 Xtn (mod p) 

has an integer solutions < xi, ■ ■ ■ ,Xt < t' once t > T. In fact, for large p the number of 
such solutions 

N(t,p, A) = #{(xi, . . . ,x t ) : < xt, . . . ,x t < t p } 

satisfies 

N(t,p,\) e 



2p ' p 

independently in the parameter A and uniformly in the number t. In particular, if we let 
N\(t,p, A) be the number of such solutions with Xj = Xj for some i 7^ j, then N\(t,p, A) <C 
^^X^/p. Indeed, the pair (i, j) with i 7^ j can be chosen in 0(t 2 ) ways, and the common 
value of Xi — Xj Cclll be chosen in #X ways. Once these two data are chosen, then the 
number of ways of choosing x s € {0, 1, . . . , t' p — 1} with s £ {1, 2, . . . , j} such that 

A _ 2*,n _ gayn _ ^ 2 x a n ( mod p ) 
l<s<t 
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is N(t - 2,p, A - 2 XiJl - 2^ n ) < #X l - 2 /p for t > T + 2. In conclusion, if all solutions 
xi, . . . , Xt have two components equal, then p 1 / 5 <C #X <C t 2 , sopC t 10 . For us, t < 2w 2 , 
so p <C w 20 . Since -P(fc) = P(k/l) > u; 100 , it follows that at least for the largest prime 
p = P(k), we may assume that xi,...,xt are all distinct modulo p for a suitable value of A. 

We apply the above result with A = U , t = 1(f)(1) (note that since t > y, it follows that 
t > T + 2 does indeed hold for large values of x), and write x(p) = (xi(p), . . . , cct(p)) for a 
solution of 

U = 2 Mp)n + ... + 2 xt( P )n (modp), < Xi(p) < ... < X t (p) < ^. 

We also assume that for at least one prime (namely the largest one) the Xj(p)'s are distinct. 
Now choose integers x±, . . . ,xt such that 

Xi = Xi(p) (mod t' p ) 

for all p | This is possible by the Chinese Remainder Lemma since the numbers t' 

are coprime as p varies over the distinct prime factors of k/£. We assume that for each i, 
Xi is the minimal nonnegative integer in the corresponding arithmetic progression modulo 
rip|fc/^p- Further, since nxi(p) are distinct modulo t' p when p = P(k), it follows that nxi 
are also distinct for i = l,...,t. Hence, for such Xj's we have that U — 2^* = i 2 Xi ™ is a 
multiple of all p \ k/l, and since kjl is square-free, we get that U = Y^l=i ^ Xin (mod k/i). 
But the above congruence is also valid modulo £, so it is valid modulo k = lcm[£, k/l], since 
I and k/l are coprime. Thus, 

t 

U = J2 2Xin ( m od k), 

i=l 

or 

t 

2 k + m ~ l _ 1 = 2^W _1 H + 2 < ^~( n °~ 1 ) + 2*( £ )^( n o^ 1 ) + s ^2 Xlfl (mod fe). 

i=i 

As we have said, the numbers Xiii are distinct and they can be chosen of sizes at most 
nlcm[tp : p \ kl] < <j){k) < k. Finally, nxi are divisible by (f)(1) whereas none of the numbers 
4>(l) — j for j = 1, . . . , no — 1 is unless no = 1. Thus, assuming that no ^ 1, we get that all 
the m = t + n,Q exponents are distinct except for the fact that (f)(1) — (uq — 1) appears twice. 
Let us first justify that no 7^ 1. Recalling the definition of no, we get that if this were so then 
4>(£) j k. However, we have just said that I has a prime factor p > y. If (f>(l) \ k, then k is 
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divisible by both p and p—1 for some p > y and this is impossible since n A\. Finally, to 
deal with the repetition of the exponent 4>{£) — (no — 1), we replace this by 4>(£) — (no — 1) +t k , 
where as usual tk is the order of 2 modulo n. We show that with this replacement, all the 
exponents are distinct. Indeed, this replacement will not change the value of 2 < ^) - ( no-1 )+* fc 
(mod k). Assume that after this replacement, <p(£) — (no — 1) +t k is still one of the remaining 
exponents. If it has become a multiple of n, it follows that it is in particular divisible by t p for 
all primes p \ t. Since t p \ t k and t p \ 4>(£) for all primes p \ £, we get that t p | no — 1, so t p \ k. 
Since £ is divisible by some prime p > y (because k As), we get that t p \ k. Since k A2, 
we get that t p > p 1 ^ 3 . Thus, k is divisible by a prime p > y and a divisor d of p — 1 with 
d > p 1 / 3 , and this is false since n A7. Hence, this is impossible, so it must be the case that 
(/)(£)- (n Q -l)+t k £ {<p(£)-l, . . . ,(p(£)-(n -l)}. This shows that t k < n <£<j){£) < 2 w w 20 . 
However, t k is a multiple of tp^ > P(k) 1 / S , showing that P(k) < 2 S0 w G0 , which is false 
for large x since k Aq. Thus, the new exponents are all distinct for our values of k. As 
far as their sizes go, note that since k has at least two odd prime factors, it follows that 
t k I (f)(k)/2, therefore <j)(l) - (n - 1) + t k < w + <j)(k)/2 < w + k/2 < k since k > 2w for 
large x. Thus, we have obtained a representation of 2 k+m — 1 modulo m of the form 

2 jl + ■■■ + 2 jm (mod k) 

where < j% < . . . < j m < k, which shows that k £ C m . Since m < m(k) and C m C C m ( fc ), 
the conclusion follows. □ 

Remark 10. The above proof shows that in fact the number of odd k < x such that k C m (k) 
is 0(x/logloglogx). 

Corollary 11. For large x, the inequality c k /2 k < 2 m ( fc )-(l°gk)/(l°g2) holds for all odd k < x 
with at most <9(x/ log log log 2;) exceptions. 

Proof. This follows from the fact that c k = a k /k < 2 k+m /k, where k G C m (see Theorem 
[6J, together with above Theorem [9] and Remark 1101 □ 

Corollary 12. The estimate 



I y c ± = o{ - 

x ^— ' 2 k \ log log log x 

k odd 



holds for all x. 
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Proof. If k < x/logx is odd, then cj~/2 k < 1, so 



E 



Cfc „ x 



< 



2 k log x 

k<x/logx 
k odd 

If /c G [x/logx,x] but A; C m ^, then still c^/2 fc < 1 and, by the Corollary [TTl the number 
of such /c's is 0(x/logloglogx). Thus, 



E S« 



2 fe log log log x 

k£[x/logx,x] 
k odd 



For the remaining odd values of k < x, we have that 

ffc < 2 m(fc)-(logfc)/(log2) 
2 fe - ' 

so it suffices to show that 

1 



2 m(fc)-(logfc)/(log2) 



< 



log log log X 
is equivalent to 

(log /c)/(log2) — m(k) > log log log log xj log 2, 
which in turn is implied by 

log(x/logx) — log log log log x > (log2)exp(4000(loglogloga;) 3 ), 
and this is certainly true for large x. Thus, indeed, 

°k „ ( x 



E i = o 



2 k \ log log log x 

l<k<x \ to to to 

k odd 

which is what we wanted to prove. □ 
In particular, 



\ E % = oiD (23) 



x ^ 2 

k<x 
k odd 

as x — ► co. One can adapt these techniques to obtain that the whole sequence Cfc/2 fc is 
convergent to in arithmetic average. In order to do so, the sets C m should be suitably 
modified and an analog of Theorem [9] for these new sets should be proved. We leave this 
for a subsequent work. 
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7 Existence and bounds for at in base q > 2 



Let q > 2 be a fixed integer and let x be a positive real number. Put 

Vk(x) = {0 < n < x : s q (n) = k}, 
Vk(x;h,m) = {0 < n < x : s q {n) = k, n = h (mod m)}. 

Mauduit and Sarkozy proved in [12J that if gcd(m, q(q — 1)) = 1, then there exists some 
constant cq depending on q such that if we put 

i = min {k, (q — 1) [log x j log q\ — k} , 

then Vfc(x) is well distributed in residues classes modulo m provided that m < exp(co^ 1//2 ). 
Taking m = k and h = 0, we deduce that if k < exp(co^ 1//2 ), then 

V k (x;0,k) = (l + o(l))V k (x)/k 

as x — > oo uniformly in our range for k. The condition on k is equivalent to log/c <C l 1 ^ 2 , 
which is implied by k + 0((log k) 2 ) <C logx. Thus, we have the following result. 

Lemma 13. Let q > 2 be fixed. There exists a constant c\ such that if k is any positive 
integer with gcd(k,q(q — 1)) = 1, then Vfc(x) is well distributed in arithmetic progressions 
of modulus k whenever x > exp(cifc). 



Corollary 2 of [12j implies that if 

i-l 



21og<? 



log x — k 



o(logx) as x — ► oo, (24) 



then the estimate 



A 2 / A 3 
#V k {x) = - '"■ 1/9 exp ( -c 3 - + O + 



(logx) 1 / 2 \ logx \(logx) 2 (logx) 1 / 2 

holds with some explicit constant C3 depending on q. As a corollary of this result, we deduce 
the following result. 

Lemma 14. If condition (|24"|) is satisfied, then V k {x) / 0. 

In case k and g are coprime but k and g — 1 are not, we may apply instead Theorem B 
of [11] with m = k and h = to arrive at a similar result. 
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Lemma 15. Assume that q > 2 is fixed. There exists a constant C4 depending only on q 
such that ifk is a positive integer with gcd(/c, q) = 1, and x > exp(c4/c), then Vk(x; 0, k) 7^ 0. 

One can even remove the coprimality condition on q and k. Assume that x is sufficiently 
large such that 

A < c 5 (logx) 5/8 , (25) 

where C5 is some suitable constant depending on q. Using Theorem C and Lemma 5 of 
with m = k and h = 0, we obtain the following result. 

Lemma 16. Assume that both estimates (|25p and k < 2( lo s :c ) 1 ' /4 hold. Then Vk(x; 0, k) 7^ 0. 

A sufficient condition on x for Lemma [TBI above to hold is that x > exp(cQk), where c% is 
a constant is a constant that depends on q. Putting Lemmas [T5l and [TBI together we obtain 
the next theorem. 

Theorem 17. For all q > 2 there exists a constant cq depending on q such that for all 
k > 1 there exists n < exp(c$k) with s q (kn) = k. 

Consequently, = exp(0(fc)) for all k, and in particular it is nonzero. The following 
example of Lemma [TH1 shows that = exp(o(fc)) does not always hold as k — > 00. 

Lemma 18. If q > 2, then 

a qm = q m I 2qT=r _ 1 j . 
Ifq = 2, then a 2 ™ = 2 m (2 2m - 1). 

Proof. The fact that s q (a q m) = q m for all q > 2 is immediate. We now show the minimality 
of the given a q m with this property. Let a m = (q m — l)/(q — 1). Note that every digit of 
q a ™ — 1 in base q is maximal, so q° m — 1 is minimal such that s q {q am — 1) = q m — 1. Since 

g*™ - 1 = (q - l)^™- 1 + ( q - l)q a ™~ 2 + ■ ■ ■ + (q - 1), 

then a g ™ must contain the least term q t , where t > a m — 1 such that its sum of digits is 
q m and q m \a q m. The least term is obviously q am , and it just happens that a q m such defined 
satisfies the mentioned conditions. □ 
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